Speech Transcript Evaluation for Information Retrieval

نویسندگان

Laurens van der Werff

Wessel Kraaij

Franciska de Jong

چکیده

Speech recognition transcripts are being used in various fields of research and practical applications, putting various demands on their accuracy. Traditionally ASR research has used intrinsic evaluation measures such as word error rate to determine transcript quality. In non-dictation-type applications such as speech retrieval, it is better to use extrinsic (or task specific) measures. Indexation and the associated processing may eliminate certain errors, whereas the search query may reveal others. In this work, we argue that the standard extrinsic speech retrieval measure average precision is unpractical for ASR evaluation. As an alternative we propose the use of ranked correlation measures on the output of the speech retrieval task, with the goal of predicting relative mean average precision. The measures we used showed a reasonably high correlation with average precision, but require much less human effort to calculate and can be more easily deployed in a variety of real-life settings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Retrieval under Limited Resources and Open Domain Conditions

Speech Retrieval focuses on retrieving a segment of speech from a speech corpus correspond to a given query. A standard Speech Retrieval system usually composed by two systems, the Automatic Speech Recognition (ASR) system and the Information Retrieval (IR) system. The ASR system transcribes the speech and represents the transcript in different formats. The transcript is then indexed and search...

متن کامل

Accessing speech data using strategic fixation

When users access information from text, they engage in strategic fixation, visually scanning the text to focus on regions of interest. However, because speech is both serial and ephemeral, it does not readily support strategic fixation. This paper describes two design principles, indexing and transcript-centric access that address the problem of speech access by supporting strategic fixation. ...

متن کامل

Experiments in Spoken Document Retrieval at CMU

We describe our submission to the TREC-6 Spoken Document Retrieval (SDR) track and the speech recognition and the information retrieval engines. We present SDR evaluation results and a brief analysis. A few developments and experiments are also described in detail including: • Vocabulary size experiments, which assess the effect of words missing from the speech recognition vocabulary. For our 5...

متن کامل

Multimedia Retrieval in MultiMatch: The Impact of Speech Transcript Errors on Search Behaviour

This study discusses the findings of an evaluation study on the performance of a multimedia multimodal information access sub-system (MIAS), incorporating automatic speech recognition technology (ASR) to automatically transcribe the speech content of video soundtracks. The study’s results indicate that an information-rich but minimalist graphical interface is preferred. It was also discovered t...

متن کامل

Speech recognition in the Informedia Digital Video Library: uses and limitations

In principle, speech recognition technology can make any spoken data useful for library indexing and retrieval. This paper describes the Informedia Digital Video Library project and discusses how speech recognition is used for transcript creation from video, alignment with handgenerated transcripts, query interface and audio paragraph segmentation. The results show that speech recognition accur...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Speech Transcript Evaluation for Information Retrieval

نویسندگان

چکیده

منابع مشابه

Speech Retrieval under Limited Resources and Open Domain Conditions

Accessing speech data using strategic fixation

Experiments in Spoken Document Retrieval at CMU

Multimedia Retrieval in MultiMatch: The Impact of Speech Transcript Errors on Search Behaviour

Speech recognition in the Informedia Digital Video Library: uses and limitations

عنوان ژورنال:

اشتراک گذاری